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ABSTRACT 



We show that a regression of unsmoothed peculiar velocity measurements against 
peculiar velocities predicted from a smoothed galaxy density field leads to a biased 
estimate of the cosmological density parameter Q, even when galaxies trace the 
underlying mass distribution and galaxy positions and velocities are known perfectly 
The bias arises because the errors in the predicted velocities are correlated with 
the predicted velocities themselves. We investigate this bias using cosmological 
N-body simulations and analytic arguments. In linear perturbation theory, for cold 
dark matter power spectra and Gaussian or top hat smoothing filters, the bias in 
f2 is always positive, and its magnitude increases with increasing smoothing scale. 
This linear calculation reproduces the N-body results for Gaussian smoothing radii 
R s ^ 10/t Mpc, while non-linear effects lower the bias on smaller smoothing scales, 
and for R s <J 3/i _1 Mpc Q is underestimated rather than overestimated. The net bias 
in f2 for a given smoothing filter depends on the underlying cosmological model. The 
effect on current estimates of £1 from velocity- velocity comparisons is probably small 
relative to other uncertainties, but taking full advantage of the statistical precision of 
future peculiar velocity data sets will require either equal smoothing of the predicted 
and measured velocity fields or careful accounting for the biases discussed here. 



Subject headings: cosmology: theory galaxies: distances and redshifts, methods: 
numerical 
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1. Introduction 

One of the most popular approaches to constraining the mass density parameter Q, the 
ratio of the average matter density to the critical density, is based on comparisons between the 
galaxy density field mapped by redshift surveys and the galaxy peculiar velocity field inferred 
from distance- indicator surveys (see the review by Strauss fc Willick 1995| ). While the numerous 



implementations of this approach differ in many details, they are all motivated by the linear theory 
formula for the peculiar velocity field, 

ff o/(S!) t „_,, (x' - x) j3 



v(x) = ^/*-(x<)^A\ (1) 
or its divergence 

V • v(x) = -a H f(n)S(x) , (2) 

where <5(x) = p(pc)/p — 1 is the mass density contrast, f(Q) ~ O 0,6 , Hq is the Hubble parameter, 
and do is the present value of the expansion factor ( Peebles 1980| ) .0 "Velocity- velocity" comparisons 



start from the observed galaxy density field, predict peculiar velocities via equation ((!]) or some 
non- linear generalization of it, and compare to estimated peculiar velocities (e.g., [Kaiser et al 



19911 [Strauss fc Willick 19951; pavis, Nusser, fe Willick 199ft |Willick et al. 1997| , 1998; |Blakeslee 



et al. 1999 ). "Density-density" comparisons start from the observed radial peculiar velocity field, 
infer the 3-dimensional velocity field using the POTENT method of Bertschinger & Dekel (1989), 
and compare the velocity divergence to the observed galaxy density field usi ng equation ^ o r 



a non-linear generalization of it (e.g., Dekel et al. 1993| ; |Hudson et al. 1995 ; gigad et al. 199S ; 



Dekel et al. 1999|) . Because the radial velocity field must be smoothed before computing the 
3-dimensional velocity field via POTENT, density-density comparisons in practice always compare 
the smoothed galaxy density field to predictions derived from the smoothed peculiar velocity 
field. Velocity-velocity comparisons, on the other hand, usually smooth the galaxy density field 
to suppress non-linear effects and shot noise, but compare the velocity predictions from these 
smoothed density fields directly to the estimated peculiar velocities of individual galaxies or 
groups. (The spherical harmonic analysis of Davis et al. 1996 is an important exception in this 
regard.) 

The avoidance of smoothing the data is often seen as an advantage of the velocity-velocity 
approach, since smoothing a noisy estimated velocity field can introduce statistical biases that are 
difficult to remove. However, in this paper we show that comparing smoothed velocity predictions 
to unsmoothed velocity measurements generally leads to biased estimates of f(£l), even when the 
galaxy positions and velocities are known perfectly. The reason for this bias is fairly simple: the 
errors in the predicted velocities are correlated with the predicted velocities themselves, violating 
the conventional assumption that an individual galaxy's velocity can be modeled as a "large 
scale" contribution predicted from the smoothed density field plus an uncorrelated "small scale" 
contribution. 



2 Because galaxy distances are inferred from their redshifts via Hubble's law, uncertainties in 
Hq and clq do not introduce any uncertainy in peculiar velocity predictions; if one adopts km s _1 
distance units in place of Mpc, then Hq and qq do not appear in equation ([!]) or @j. 
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Galaxy redshift surveys map the galaxy density field S g (x) rather than the mass density 
field 5(x), so inferences from velocity- velocity and density-density comparisons often assume a 
linear relation between galaxy and mass density contrasts, 5 g (x) = 6<5(x), and therefore constrain 
the quantity (3 = /(0)/6 rather than f(Q) itself. The results reported in this paper emerged 
from a more general investigation of the effects of complex galaxy formation models on estimates 



of (3 ( Berlind, Narayanan fc Weinberg 1999; Berlind, Narayanan Sz Weinberg, in preparation ) 



However, the statistical bias in f(£l) that we find applies even when galaxies trace mass exactly, 
so here we focus on this simpler case. (Throughout this paper we use the term "bias" to refer 
to systematic statistical errors rather than the relation between the distributions of galaxies and 
mass.) We further restrict our investigation to the case in which galaxy positions and velocities 
are known perfectly, ignoring the additional complications that arise in analyses of observational 
data. 



2. Results 

We have carried out N-body simulations of three different cosmological models, all 
based on inflation and cold dark matter (CDM). The first is an Q = 1, h = 0.5 model 
{h = .ffo/100 km s _1 Mpc -1 ), with a tilted power spectrum of density fluctuations designed 
to satisfy both COBE and cluster normalization constraints. The cluster constraint requires 



as ~ 0.55 (White, Efstathiou & Prenk 1993), where a% is the rms linear density fluctuation in 
spheres of radius 8/i _1 Mpc. Matching the COBE-DMR constraint and as = 0.55 with h = 0.5 
requires an inflationary spectral index n = 0.803 if one incorporates the standard inflationary 



prediction for gravitational wave contributions to the COBE anisotropies (see Cole et al. 1997 
and references therein). The other two models have f2 = 0.2 and 0.4, with a power spectrum 
shape parameter T = 0.25 (in the parameterization of Efstathiou, Bond &: White 1992|) and 



cluster-normalized fluctuation amplitude as = 0.55S7" 6 . We ran four independent simulations for 
each of the three cosmological models, and the results we show below are averaged over these four 
simulations. All simulations were run with a particle-mesh (PM) N-body code written by C. Park, 



which is described and tested by Park (1990) . Each simulation uses a 400 3 force mesh to follow 
the gravitational evolution of 200 3 particles in a periodic cube 400/i _1 Mpc on a side, starting at 
z = 23 and advancing to z = in 46 steps of equal expansion factor a. We form the mass density 
field by cloud-in-cell (CIC) binning the evolved mass distribution onto a 200 3 grid. We smooth 
this density field with a Gaussian filter of radius R s and derive the linear-theory predicted velocity 
field using equation (|T|). Finally, we linearly interpolate this velocity field to the galaxy positions 
to derive predicted galaxy peculiar velocities v pre d. 

Figure 1 compares the true velocities of particles (v truo ) from one of the f2 = 1 simulations to 
the velocities predicted (v prec i) by equation ([[]) from the mass density field smoothed with Gaussian 
filters of radius R s = 3,5,10, and 15/i _1 Mpc (panels a-d, respectively). The points in Figure 1 
show one Cartesian component of the particles' velocities. If we make the assumption, common 
to most velocity-velocity comparison schemes, that each galaxy's velocity consists of a large 
scale contribution predicted from the density field plus an uncorrelated small scale contribution, 
then the best-fit slope of the v true — v pre( j relation should yield the parameter f(£l), in this case 
f(£l) = 1, with the scatter about this line yielding the dispersion of the small scale contribution. 
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However, it is clear from Figure 1 that this slope increases systematically with increasing R s . (We 
note that the best-fit line, which minimizes |v true — v preG ;| 2 , is shallower than the line one would 
naively draw through these data points by eye, since it is vertical scatter rather than perpendicular 
scatter that must be minimized.) 

The filled points in Figure 2 show the estimated f(£l) as a function of R s for the O = 1 
(circles) and f2 = 0.2 (squares) cosmological models. The solid lines show the true value of /(O). 
In both cases, the estimated value of /(fi) is quite sensitive to the smoothing scale: it is slightly 
underestimated at small scales, but increasingly overestimated at large scales. The £1 = 0.4 model 
yields similar results, so we do not plot it separately. We also investigated = 1 simulations 
with a factor of two lower force resolution (200 3 force mesh instead of 400 3 ) and found identical 
results, so even at small smoothing scales our results are not affected by the simulations' limited 
gravitational resolution. The breakdown of linear theory at small scales is not surprising; however, 
the systematic failure of this method at large smoothing scales has not, to our knowledge, been 
previously discussed. The dependence of the estimated /(H) on the smoothing scale used for 
velocity predictions is our principal result. 

We can understand the origin of the large scale bias in f(£l) by considering the case in which 
galaxy peculiar velocities are given exactly by linear theory. In this case, 

v tmc (x) = (2irf/ 2 H f(n) J e ikx ^dk , (3) 

where 5^ are the Fourier modes of the density field and the integral extends over all of k-space. 
Predicted velocities, however, are estimated from the density field smoothed with a window 
function W(r) of characteristic scale R s . Therefore, 

v pred (x) = (27r) 3 / 2 # /(ft) J W(kR s )e lk - x ^dk , (4) 

where W{kR s ) is the Fourier transform of the window function. The error in the predicted velocity 
of a galaxy at position x is therefore, 

Av(x) = v true - v pred = (2vr) 3 / 2 H f(Q) J[l- W(kR s )}^ l -^dk . (5) 

Note that in equation (£|) we have defined Vp re( j to be the velocity that would be predicted 
assuming the correct value of 0. In practice, since we do not know the value of /(fl) beforehand, 
we derive its value from the slope of the v trU e vs. /~ 1 v pre d relation (this is equivalent to assuming 
£1 = 1 when computing v pre( j). 

If Av were uncorrelated with v pred , then the slope of the vt rue vs. / v pred relation would 
be an unbiased estimator of /(fi). However, if Av is positively correlated with v pred , then the 
slope of the relation is no longer /(fi), since points preferentially scatter above the line for positive 
v pre< j and below the line for negative v prec ;. This steepening of the v true — v pre(1 relation is just 
the behavior seen in Figure 1. Equations (|j) and (||) show that Av and v prec ; will be correlated 
as long as some Fourier modes contribute to both integrals, which happens for any smoothing 
function other than a step function in k-space. 
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We can quantitatively understand this bias by considering how /(fi) is measured. 
For an ensemble of N points (v true j, / _1 v prec i i i), the slope of the best-fit line (assuming 

(Vtrue) = (Vprod) = 0) is 



slope 



E(r 

2~Z(/ ^Vpred,i • Vprgd^) 



jy E[(vtrue,i V pre d,i) ' Vpred, i ~i~ ( Vpred, i ' Vpred, «)] 



1 + 



XX Vpred, i ' V pr ed, 
Av • V pred ) 



( Vpred ' Vp re( j) 



(6) 



Equation @ shows how a non-zero cross-correlation between Av a,ncl Vp re( j changes the measured 
slope of the velocity-velocity relation. We can compute this effect in the linear regime for a given 
power spectrum of density fluctuations P(k) and window function W(kR s ). Using equations (Q) 
and (||) we have 

(Av ■ v pre d) _ / °° W(kR s )[l - W{kR s ))P{k)dk 

(Vpred • V pr ed) f™ W 2 (kR s )P{k)dk 

For Gaussian and top hat window functions and a range of CDM power spectra, we find that 
the bias given by equation (0) is always positive and is always an increasing function of R s . The 
dashed lines in Figure 2 show the slope computed (from eqs. |6] and 0) using the linear mass power 
spectra of the simulations and the same Gaussian window functions that were used to measure 
f(£l) (solid points). The striking similarity on large smoothing scales between the N-body data 
and this linear theory calculation supports our conclusion that the large scale bias is indeed caused 
by the cross-correlation between Av and v pr ed) which, in turn, is caused by the comparison of a 
smoothed prediction to unsmoothed data. 

From equation (|^) it is evident that the linear theory cross-correlation between Av and v pre d 
will be equal to zero if there is no smoothing at all, or if the smoothing function is a step function 
in k-space, in which case the product W(kR s )[l — W(kR s )] is always equal to zero. The open 
symbols in Figure 2 show results of a velocity-velocity analysis of the same simulations, with the 
linear theory velocities now predicted from a density field smoothed with a sharp, low-pass k-space 
filter. Specifically, we set to zero all Fourier modes with k > k cut and plot the new estimates of 
/(O) at the values of R s for which a Gaussian filter falls to half its peak value at k = k cnt (i.e., 
e~ k cut R s/ 2 = 0.5). Using the sharp k-space filter causes the bias to vanish completely on large 
scales, yielding estimates of f(£l) that are correct and independent of smoothing length. This 
result further supports our interpretation of the cause of the large scale velocity-velocity bias. 

Figure 2 shows that f(£l) is underestimated at small scales in the N-body simulations. The 
linear theory bias discussed above and shown by the dashed line in Figure 2 is always positive. 
Therefore, there must be a countervailing effect that biases f(Q) estimates in the opposite direction 
on small scales. In highly non-linear regions of the density field, such as the cores of galaxy 
clusters, linear theory velocity predictions have large errors. However, errors caused by virial 
motions are uncorrelated with the predicted velocities because these virial motions have random 
directions. Such errors add random scatter to the velocity-velocity relation, but they do not change 
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its slope. In mildly non-linear regions of the density field, on the other hand, galaxy velocities still 
follow coherent flows, but these flows may no longer be accurately predicted by linear theory. In 
the case of a galaxy falling towards a large over-density, linear theory will correctly predict the 
direction of motion, but it will overestimate the infall speed because it incorrectly assumes that 
the over-density has grown at the linear theory rate over the history of the universe, while in 
reality the over-density grows to large amplitude only at late times when it becomes non-linear. 
In such regions, Av will be opposite in sign to v pre( i, causing an anti-correlation between the two 
quantities. The opposite happens in under-dense regions, but since fewer galaxies reside in these 
regions and the velocity errors are smaller in magnitude, the net effect is still an anti-correlation 
between Av and v pre d- 

In order to show how these different effects come into play, we adopt a fluid dynamics 
description and divide an individual galaxy's velocity into a mean flow v and a random "thermal" 
velocity <x, so that v true = v + a. Here v(x) is the average velocity of galaxies at spatial position 
x, and therefore (<x • v) = by definition. Let vii n denote the velocity predicted in linear theory 
from the unsmoothed density field (eq. [j]) . Equation (||) applies to the case where the velocity field 
is exactly linear, v true = vi in , but more generally, 

Vtrue = V + a 

= V pred + (Vlin - V pr ed) + (v - Vi in ) + <T , (8) 

and, therefore, 

Av = Vtrue - Vpred = (viin ~ V prc d) + (v - Vi in ) + <T . (9) 

This equation shows the three possible sources of error in the smoothed linear theory prediction 
of galaxy velocities. The first term represents the effect caused by comparing a smoothed quantity 
with an unsmoothed quantity in linear theory and is given by equation (||). The second term 
represents the inadequacy of using a linear theory velocity estimator in regions where non-linear 
effects are important. The third term represents errors caused by galaxies' random thermal 
motions. 

As shown in equation @, the bias in f(Q) depends on the cross-correlation of these errors 

With Vpred, 

(Av • Vpred) = ((Vlin - V pre d) ' V pre d) + ((v - V un ) • V pr cd) + (<T • V prcd ) . (10) 

The first term is positive and causes an overestimate of f(£l) for nearly all smoothing functions. 
Our calculation of this effect via equation (^) shows that it is zero for no smoothing and increases 
monotonically with smoothing scale. We have argued above that the second term is generally 
negative and causes an underestimate of f(£l). Since this effect arises from the non-linearity in 
the density field, it should dominate on small scales and vanish with increased smoothing of the 
density field. Finally, the third term is equal to zero because the thermal velocities have random 



directions. A combination of the first two terms of equation (10) explains the scale dependence 
of /(fi) estimates in Figure 2. For large smoothing of the density field, the first term dominates 
and we overestimate /(O), whereas for small smoothing the second term dominates and we 
underestimate /(fi). The estimate of f(£l) is unbiased at the smoothing scale where these two 
effects cancel, but this scale should itself depend on the specifics of the underlying cosmological 
model. The numerical results in Figure 2 confirm this prediction: the /(O) estimate is unbiased 
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at R s = 5/i Mpc in the Q = 0.2 model (with a$ = 1.44; squares) and at R s = 4/i _1 Mpc in the 
Q = 1 model (with erg = 0.55; circles). The smoothing scale for unbiased estimates could also 
depend on the assumed relation between galaxies and mass, a point we will investigate in future 
work. It is therefore, not possible to remove this bias simply by choosing the right smoothing scale 
in a model-independent way. 

If we had adopted a higher-order perturbative expansion for predicting velocities from 
the smoothed density field, then equation ([H]) would still hold with vi in replaced by v per , the 
perturbative prediction in the absence of smoothing. The first term on the right hand side would 
still be positive, since some Fourier modes would contribute to both (v per — v pre( i) and v prec i. The 
second term could be positive or negative depending on the approximation and the smoothing 
scale. However, while a higher-order approximation might reduce the magnitude of the second 
term relative to the linear approximation, it would not necessarily reduce the net bias in /(O), 
since this depends on the relative magnitude and sign of the first two terms. 



3. Discussion 

The implications of our results for existing estimates of /(O) (or, more generally, of /3) are 
probably limited. As already mentioned, density-density comparisons via POTENT are not 
influenced by the effects discussed here, because they compare density and velocity divergence 
fields smoothed at the same scale. The analysis of Davis et al. (1996), a mode-by-mode comparison 
of density and velocity fields, is also not affected, since the two fields are again compared at 
the same effective "smoothing" . If the observed velocities are unsmoothed, a comparison in 
which velocities are predicted using a truncated spherical harmonic expansion of the density 



field (e.g., Blakeslee et al. 1999) may behave rather like our sharp k-space filter analysis (open 
symbols in Figure 2), since for a Gaussian field the different spherical harmonic components are 
statistically uncorrelated (A. Nusser, private communication; |Fisher et al. 1995 ). Among recent 



velocity-velocity studies, our procedure here is closest to the VELMOD analyses of Willick et 
al. (1997) and Willick & Strauss (1998), who used a 3/i _1 Mpc Gaussian filter to compute the 
predicted velocity field. These authors chose their smoothing scale partly on the basis of tests on 
N-body mock catalogs, and our results in Figure 2 suggest that biases in /(f2) should indeed be 
small for this smoothing. However, we have shown that the disappearance of the bias in /(fi) at 
this smoothing scale occurs because of a cancellation between positive and negative biases, and 
that the scale at which this cancellation occurs depends at least to some degree on the underlying 
cosmological model. As improvements in observational data reduce the statistical uncertainties in 
peculiar velocity data, control of the systematic uncertainties that arise from comparing smoothed 
velocity predictions to unsmoothed data will become essential to obtaining robust estimates of the 
density parameter. 
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Fig. 1. — True velocities of particles from an f2 = 1, CDM, cosmological N-body simulation 
compared to the velocities predicted by linear theory from the mass density field after it is smoothed 
with Gaussian filters of radius R s = 3, 5, 10 and 15/i -1 Mpc (panels a,b,c and d, respectively). The 
points show one Cartesian component of the particles' velocities. Solid lines show the relation 

v true = v pred- 
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Fig. 2. — Estimates of f(£l) from the slope of the relation between true galaxy velocities and 
velocities predicted by linear theory from the smoothed density field, as a function of the smoothing 
radius R s , for CDM models with = 1 and 0.2 (circles and squares, respectively). Points represent 
the mean result of four simulations of each model, and error bars show the uncertainty in the mean 
derived from the dispersion among the simulations. Filled symbols show the estimated /(f2) when 
the density field is smoothed with a Gaussian filter of radius R s . Open symbols show the estimated 
f(£l) when the density field is smoothed with a sharp low-pass k-space filter (with a cut at k cn t), 
where R s is the radius of a Gaussian filter that falls to half its peak value at k = k CVL t- Dashed 
lines show the linear theory prediction of the bias in the estimates of f(fl) (eqs. || and 0) from 
comparing smoothed velocity predictions to unsmoothed velocity measurements. Solid lines show 
the true values of /(f2). 



